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TESTING UNIFORMITY ON HIGH-DIMENSIONAL SPHERES 
AGAINST MONOTONE ROTATION ALLY SYMMETRIC ALTERNATIVES 


By Christine Cutting, Davy Paindaveine* and Thomas Verdebout 

Universite litre de Bruxelles 

We consider the problem of testing uniformity on high-dimen¬ 
sional unit spheres. We are primarily interested in non-null issues. 

We show that rotationally symmetric alternatives lead to two Local 
Asymptotic Normality (LAN) structures. The first one is for fixed 
modal location 6 and allows to derive locally asymptotically most 
powerful tests under specified 9. The second one, that addresses the 
Fisher-von Mises-Langevin (FvML) case, relates to the unspecified-# 
problem and shows that the high-dimensional Rayleigh test is locally 
asymptotically most powerful invariant. Under mild assumptions, we 
derive the asymptotic non-null distribution of this test, which allows 
to extend away from the FvML case the asymptotic powers obtained 
there from Le Cam’s third lemma. Throughout, we allow the dimen¬ 
sion p to go to infinity in an arbitrary way as a function of the sample 
size n. Some of our results also strengthen the local optimality prop¬ 
erties of the Rayleigh test in low dimensions. We perform a Monte 
Carlo study to illustrate our asymptotic results. Finally, we treat an 
application related to testing for sphericity in high dimensions. 


1. Introduction. In directional statistics, inference is based on p-variate observa¬ 
tions lying on the unit sphere 5 P_1 := {x 6 M p : ||x|| = \/ x'x = 1}. This is relevant in 
various situations, (i) First, the original data themselves may belong to 5 P_1 ; classical 
examples involve wind direction data (p = 2) or spatial data at the earth scale (p = 3). 
(ii) Second, some fields by nature are so that only the relative magnitude of the observa¬ 
tions is important, which leads to projecting observations onto 5 P_1 . In shape analysis, for 
instance, this projection only gets rid of an overall scale factor related to the (irrelevant) 
object size, (iii) Finally, even in inference problems where the full (Euclidean) observations 
in principle need to be considered, a common practice in nonparametric statistics is to 
restrict to sign procedures, that is, to procedures that are measurable with respect to the 
projections of the observations onto 5 P_1 ; see, e.g., Oja (2010) and the references therein. 

While (i) is obviously restricted to small dimensions p, (ii)-(iii) nowadays increasingly 
involve high-dimensional data. For (ii), high-dimensional directional data were considered 
in Dryden (2005), with applications in brain shape modeling; in text mining, Banerjee et al. 
(2003) and Banerjee and Ghosh (2004) project high-dimensional data on unit spheres to 
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discard text sizes when performing clustering. As for (iii), the huge interest raised by high¬ 
dimensional statistics in the last decade has made it natural to consider high-dimensional 
sign tests. In particular, Zou et al. (2014) recently considered the high-dimensional version 
of the Hallin and Paindaveine (2006) sign tests of sphericity, whereas an extension to the 
high-dimensional case of the location sign test from Chaudhuri (1992) and Mottonen and 
Oja (1995) was recently proposed in Wang, Peng and Li (2015). Considering (iii) in high 
dimensions is particularly appealing since for moderate-to-large p, sign tests show excellent 
(fixed-p) efficiency properties (see Paindaveine and Verdebout (2015) for details). Also, the 
concentration-of-measure phenomenon may make the restriction to signs virtually void as 
the dimension p increases. 

In this paper, we consider the problem of testing uniformity on the unit sphere S p ~ l , 
both in low and high dimensions. In low dimensions, this is a fundamental problem that 
has been extensively treated; see Mardia and Jupp (2000) and the references therein. 
The high-dimensional version of the problem is less standard, yet also has some history. 
Cuesta-Albertos, Cuevas and Fraiman (2009) proposed a test of uniformity that performs 
well empirically even in high dimensions, but no asymptotic results were obtained as p 
goes to infinity. Chikuse (1991, 1993) explicitly considered high-dimensional testing for 
uniformity on the sphere, in a fixed-?! large-p framework, while Cai, Fan and Jiang (2013) 
rather adopted a double asymptotic approach for the same problem. Possible applications 
of testing uniformity on high-dimensional spheres include outlier detection; see Juan and 
Prieto (2001). Other natural applications are related with testing for sphericity in M p , in 
the spirit of (iii) above; in Section 7, we will elaborate on this and provide references. 

To be more specific, assume that the observations form of a triangular array of ran¬ 
dom vectors X n j, i = l,...,n, n = 1,2,..., where, for any n, the X r „;’s are mutually 
independent and share a common distribution on the unit sphere 5 Pn_1 , and consider the 
problem of testing the null hypothesis TLo n that this common distribution is the uniform 
over 5 Pn_1 . While our main interest is in the high-dimensional case (p n -A oo), most of 
our results will also address the (low-dimensional) classical hxed-p case ( p n = p for all n). 
The most classical test of uniformity is the Rayleigh (1919) test, that rejects Hon for large 
values of R n := ??pn||X n || 2 , where X n := ^ X)i=i For fixed p, the test is based on 
the null asymptotic y 2 distribution of R n . In the high-dimensional setup, Paindaveine and 
Verdebout (2015) obtained the following asymptotic normality result under the null. 

Theorem 1.1. Let (p n ) be a sequence of positive integers diverging to oo as n -A oo. 
Assume that the triangular array X n j, i = 1,..., n, n = 1,2,..., is such that, for any n, 
X n i, X n 2 ,..., X nn form a random sample from the uniform distribution on 5 Pn_1 . Then 


( 1 . 1 ) 





as n -A oo, where -A denotes weak convergence. 
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Denoting by <£(•) the cumulative distribution function of the standard normal, the high- 
dimensional Rayleigh test (4>^ n \ say) then rejects 'Hon at asymptotic level a whenever 

(1.2) > z a , with z a := <3? _1 (1 — a). 

Remarkably, this test does not impose any condition on the way p n goes to infinity with n, 
hence can be applied as soon as n and p n are large, without bothering about their rela¬ 
tive magnitude (in contrast, most results in high-dimensional statistics typically impose 
that p n /n —>• c for some c > 0). Theorem 1.1, however, is not sufficient to justify resorting 
to the Rayleigh test : the trivial test, that would discard the data and reject Tfon with 
probability a, has indeed the same asymptotic null behaviour as the high-dimensional 
Rayleigh test, yet has a power function that is uniformly equal to the nominal level a. 
One of the main goals of this paper is to study the non-null behaviour of the Rayleigh test 
and to show that this test actually enjoys nice optimality properties, both in the low- and 
high-dimensional cases. Optimality throughout will be in the Le Cam sense, in relation 
with the Local Asymptotic Normality (LAN) structures of the models we adopt below. 

The outline of the paper is as follows. In Section 2, we define a class of alternatives to 
the null of uniformity that skew the probability mass along a “modal direction” 0. and we 
identify the corresponding contiguous alternatives. In Section 3, we provide a LAN result 
for fixed 6 , which leads to locally asymptotically most powerful tests under specified 0. 
We address the unspecified-# problem through invariance arguments in Section 4, which, 
in the FvML case, provides a second LAN result and shows that the high-dimensional 
Rayleigh test is locally asymptotically most powerful invariant. In Section 5, we derive the 
asymptotic distribution of the high-dimensional Rayleigh test under general rotationally 
symmetric alternatives and comment on the resulting limiting powers. In Section 6, we 
illustrate our asymptotic results through simulations. In Section 7, we link the problem 
considered to that of testing for sphericity in high dimensions and we treat a real data 
example. In Section 8, we summarize the main findings of the paper and discuss some per¬ 
spectives for future reseach. Finally, the appendix and the supplementary article Cutting, 
Paindaveine and Verdebout (2015) collect technical proofs. 

2. Contiguous rotationally symmetric alternatives. Throughout, we consider 
specific alternatives to the null of uniformity over the p-dimensional unit sphere 5 P_1 , 
namely rotationally symmetric alternatives. A p-dimensional vector X is said to be rota¬ 
tionally symmetric about #(£ 5 P_1 ) if and only if OX is equal in distribution to X for any 
orthogonal px p matrix O satisfying O0 = 0 ; see, e.g., Saw (1978). Such distributions are 
fully characterized by the location parameter 0 and the cumulative distribution function F 
of X 7 #. The null of uniformity (under which 0 is not identifiable) is obtained for 

F P (t ) := c p f (1 - s 2 ) (p " 3)/2 ds, with c p := ^( 2 ) 

J-l V 7r F( 2 ) 


( 2 . 1 ) 
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where T(-) is the Euler Gamma function. Particular alternatives are given, e.g., by the 
so-called Fisher-von Mises-Langevin (FvML) distributions, that correspond to 
(2.2) 

(k/ 2)a -1 

v/rrr (E=i)X f _!(«)’ 

where T v {-) is the order-u modified Bessel function of the first kind and k(> 0) is a 
concentration parameter (the larger the value of k, the more concentrated about 0 the 
distribution is); see Mardia and Jupp (2000) for further details. 

In Sections 2 to 4, we actually restrict to “monotone” rotationally symmetric densities 
(with respect to the surface area measure on iS p_1 ) of the form 

(2.3) x i y c PtK jf(nx0), x € S p_1 , 

where d(£ 5 P_1 ) is a location parameter, k(> 0) is a concentration parameter, and 
the function / : M —> M + is monotone strictly increasing, differentiable at 0, and sat¬ 
isfies /(0) = /'(0) = 1. These conditions on /, that will be tacitly assumed throughout, 
guarantee identihability of 6, k and / : clearly, the strict monotonicity of / implies that 6 
is the modal location on 5 P_1 , whereas the constraint f'( 0) = 1 allows to identify n n 
and /. Note that irrespective of /, the boundary value k = 0 corresponds to the uniform 
distribution over S p ^ 1 . It is well-known that, if X has density (2.3), then X'0 has den¬ 
sity t i-A Cp )K j( 1 — t 2 )( p_3 )/ 2 /(Kf) I[t 6 [—1,1]] (throughout I [A] stands for the indicator 
function of the set or condition A). This is compatible with the cumulative distribution 
functions in (2.1)-(2.2), and shows that Cp jK j = l/(/j L 1 (l — i 2 )(p -3 )/ 2 /(«£) dt ). Finally, 
note that /(•) = /fvMl(') = exp(-) provides the FvML distributions above. 

As announced in the introduction, we consider triangular arrays of observations X m , 
i = 1,..., n, n = 1,2,... where the random vectors X n j, i = 1,..., n take values in 5 Pn_1 . 
More specifically, for any 0 n e 5 Pn_1 , n n > 0 and / as above, we will denote as Pg^ K j the 
hypothesis under which X„,, i = 1,..., n are mutually independent and share the common 
density x i —> c PntKn jf(K n x'0 n ). Note that larger values of K n provide increasingly severe 
deviations from the null of uniformity, which is obtained as K n goes to zero. Denoting the 

(n) 

null hypothesis as Pg , it is then natural to wonder whether or not “appropriately small” 
sequences K n make Pg’ 1 f . f and Pg""* mutually contiguous. The following result answers 
this question (see Appendix A for a proof). 

Theorem 2.1. Let (p n ) be a sequence in {2,3,...}. Let (6 n ) be a sequence such that 
0 n e S p ' 1 ^ 1 for all n, (n n ) be a positive sequence such that = O(^), and assume that f 

(n) 

is twice differentiable at 0. Then, the sequence of alternative hypotheses P^ K ^ and the 
null sequence Pq^ are mutually contiguous. 

This contiguity result covers both the low- and high-dimensional cases. In the low¬ 
dimensional case, the usual parametric rate K n ~ 1/y^ provides contiguous alternatives, 
which implies that, irrespective of /, there exist no consistent tests for TLo n ■ {Pg } 


V') 



TESTING UNIFORMITY ON HIGH-DIMENSIONAL SPHERES 


5 


against TL\ n : {Pi re ^ ,} if n n = t/ Jn, r > 0. The high-dimensional case is more interest- 

u i^n ij J ' 

ing. First, we stress that the contiguity result in Theorem 2.1 does not impose conditions 
on p n , hence in particular applies when (a) p n /n —> c for some c > 0 or (b) p n /n — > oo. 
Interestingly, the result shows that contiguity in cases (a)-(b) can be achieved for se¬ 
quences (k„) that do not converge to zero : a constant sequence ( K n ) ensures contiguity 
in case (a), whereas contiguity in case (b) may even be obtained for a sequence («„,) that 
diverges to infinity in a suitable way. In both cases, there then exist no consistent tests 
for T~Lq n : {P-; 0 } against the corresponding sequences of alternatives H\ n '■ {P^ K y}, de¬ 
spite the fact that the sequences (n n ) are not o(l). This may be puzzling at first since such 
sequences are expected to lead to severe alternatives to uniformity; it actually makes sense, 
however, that the fast increase of the dimension p n , despite the favorable sequences (n n ), 
makes the problem difficult enough to prevent the existence of consistent tests. 


3. Optimal testing under specified modal location. Whenever the modal loca¬ 
tion 0 n is specified (a case that is explicitly treated in Mardia and Jupp, 2000), optimal 
tests of uniformity can be obtained from the following Local Asymptotic Normality (LAN) 
result (see Appendix A for a proof). To the best of our knowledge, this result provides the 
first instance of the LAN structure in high dimensions. 


Theorem 3.1. Let ( p n ) be a sequence in {2,3,...} and let (6 n ) be a sequence such 
that 0 n € 5 Pn_1 for all n. Let n n = T n y/p n /n , where the positive sequence (r n ) is 0(1) but 
not o(l), and assume that f is twice differentiable at 0. Then, as n —> oo under Pq H \ 


(3.1) 


log 


dP 


(n) 


dP, 


(n) 


= TV, A, 


(n) 




where Ag 1 ' 1 := y/np n is asymptotically standard normal. In other words, the model 

f • K — 0} ( w here P^ 0 j := Pq^ for any 6 n and f) is locally asymptotically normal 
at k = 0 with central sequence Ag , Fisher information 1, and contiguity rate yjpn/n. 


This result, that covers both the low- and high-dimensional cases, reveals that the 
rate n n ~ y/pn/n in Theorem 2.1 is actually the contiguity rate of the considered model 
(that is, more severe alternatives are not contiguous to the null of uniformity). In low 
dimensions, the usual parametric contiguity rate n n ~ l/v^ is obtained. The high- 
dimensional rate is of course non-standard. Yet in the FvML high-dimensional case, this 
rate may be related to the fact that, as p —> oo, one needs to consider n p ~ ^Jp to obtain 
FvML p- vectors that provide non-degenerate weak limiting results that are different from 
those obtained from p- vectors that are uniform over the sphere (see Watson (1988) for a 
precise result); the contiguity rate n n ~ y/pn/n then intuitively results from a standard 
l/\/n-shrinkage starting from this non-trivial K p ~ ^fp high-dimensional situation. 

Now, consider the specified-$ n problem, that is, the problem of testing {Pg^} (unifor¬ 
mity over N p " _1 ) against U k >oU/{P^ k . j }. Theorem 3.1 entails that the test <pg l> rejecting 
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the null at asymptotic level a whenever 

(3.2) A ^ = y/np^X.' n 6 n > z a 

is locally asymptotically most powerful. Since Le Cam’s third lemma readily implies that 
Ag 1 ' 1 is asymptotically normal with mean r and variance one under Pq ’ 1 K p with K n = 

r \JPn/n, the corresponding asymptotic power of (fg 1 ' 1 is 
( 3 ' 3 ) = 1 - ^ " T )- 

While all results of this section so far covered both the low- and high-dimensional cases, 
we need to treat these cases separately to investigate how the Rayleigh test compares with 
the optimal test cjr e ; . 

We start with the low-dimensional case. Denoting by Xp(^) the non-central chi-square 
distribution with p degrees of freedom and non-centrality parameter 5, Le Cam’s third 
lemma allows to show that, under the contiguous alternatives Pg‘ 1 f . p with K n = r^/p/n 
(compare with the local alternatives from Theorem 3.1), 

(3.4) R n 4 Xp(r 2 ) 

as n —> oo; for the sake of completeness, we provide a proof in the supplementary article 
Cutting, Paindaveine and Verdebout (2015). Denoting by T p (-) the cumulative distribution 
function of the \p distribution, the corresponding asymptotic power of the Rayleigh test 
is therefore 

(3.5) n lim P ( £ KnJ [Rn > - «)] = p [Y > «r- 1 (l - a)], with Y ~ X 2 (r 2 ), 

which is strictly smaller than the asymptotic power in (3.3). We conclude that, in the 
specified-0 ra case, the low-dimensional Rayleigh test is not locally asymptotically most 
powerful yet shows non-trivial asymptotic powers against contiguous alternatives. 

The story is different in the high-dimensional case, as it can be guessed from the fol¬ 
lowing heuristic reasoning. In view of (3.4), we have that, as n A oo under Pq'\. p 
with K n = r^/p/n, 

St Rn ~P v xl(r 2 ) - 1 Xp-l ~{p~ 1) 

1,1 v/2 p V2P v^P ’ 

where both chi-square variables are independent. When both n and p are large, it is 
therefore expected that, under the same sequence of alternatives, R^f ~ A^(^fp > 1 + ^r)> 
where Z n ~ £ means that the distribution of Z n is close to C. Thus, in the high-dimensional 
case (where p = p n oo), R^f is expected to be standard normal under these alternatives, 
which would imply that the high-dimensional Rayleigh test in (1.2) has asymptotic powers 
equal to the nominal level a. 
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The high-dimensional LAN result in Theorem 3.1 allows to confirm these heuristics. 
Letting n n = T n ^Jp n jn, where r n is 0(1), Theorem 3.1 readily yields that, as to —> oo, 


Cov (n) 


jpLO 

i^Mog 9n ’ Kn ’ f 


dP, 


(n) 

0 


= CoV p (n)[Rl\ A ^j\ T„ + o( 1) 


Vzpv 


TV 


3/2 


E E E pM [(x' m 0 n )(x; fc x„ £ )] + o (i) = o (i), 


i= 1 1 <k<i<n 


so that Le Cam s third lemma implies that /?.)) remains asymptotically standard nor- 

(n) 

mal under P), This confirms that, unlike in the low-dimensional case, the high- 

dimensional Rayleigh test does not show any power under the contiguous alternatives 
from Theorem 3.1. In other words, the high-dimensional Rayleigh test fails to be rate- 
consistent for the specified-#™ problem. 

The Rayleigh test, however, does not make use of the specified value of the modal 
location #™, hence does not primarily address the specified-#™ problem but rather the 
unspecified-#™ one. Therefore, the key question is whether or not the Rayleigh test is 
optimal for the latter problem. We answer this question in the next section. 


4. Optimal testing under unspecified modal location. Building on the results 
of the previous section, two natural approaches, that may lead to an optimal test for 
the unspecified-#™ problem, are the following. The first one consists in substituting an 
estimator #„ for #„ in the optimal test d>i" * above. For the so-called spherical mean #„ = 
X n /||X n || (which is the MLE for #™ in the FvML case), the resulting test rejects the null 
for large values of = ^Jnp n X),#™ = ^ /np n ||X n || = R)/ 2 , hence coincides with the 
Rayleigh test. The second approach, in the spirit of Davies (1977, 1987, 2002), rather 
consists in adopting the test statistic sup 0iigiSPri -i Ag = ^ Jnp n ||X n ||, which again leads 
to the Rayleigh test. These considerations suggest that the Rayleigh test indeed may be 
optimal for the unspecified-#™ problem. In this section, we investigate whether this is the 
case or not, both in low and high dimensions. 


4.1. The low-dimensional case. To investigate the optimality properties of the low- 
dimensional Rayleigh test for the unspecified-#™ problem, it is helpful to adopt a new 
parametrization. For fixed p and /, the model is indexed by (#, k) G S p 1 x R + , where the 
value k = 0 makes # unidentified (for fixed p , the dimension of # does not depend on to, 
so that there is no need to consider sequences (#™)). We then consider the alternative 
parametrization in p, := kO, for which the fixed-p result in Theorem 3.1 readily rewrites 
as follows. 


Theorem 4 . 1 . Fix an integer p > 2 and let p n = y/pjnr n for all n, where the 
sequence ( r n ) in M p is 0 ( 1 ) but not o ( l ). Assume that f is twice differentiable at 0 . For 
any p € M. p \ {0}, let P^j := ^ p where p =: k #, with 6 € 5 P_1 . Then, as n -> oo 

under Pq ”\ log (dP^ ^/ dP ^) = r' n A (n ^ — ^|| t „|| 2 + op ( l ), where A*-' 1 -* := y/rvp~K n is 
asymptotically standard p-variate normal. 











CHR. CUTTING, D. PAINDAVEINE, AND TH. VERDEBOUT 


In the new parametrization, note that the problem of testing uniformity consists in 

(n) (n) 

testing Wq : g = 0 versus 'H 1 : g 7 ^ 0. Theorem 4.1 then ensures that the test rejecting 

the null at asymptotic level a whenever ||A^ n '|| 2 = ?rp||X „|| 2 > 4' p 1 (l — a) — that is, 
the low-dimensional Rayleigh test — is locally asymptotically maximin; see, e.g., Liese 
and Miescke (2008). This new optimality property of the low-dimensional Rayleigh test 
complements the one stating that this test is locally most powerful invariant; see, e.g., 
Chikuse (2003), Section 6.3.5. 

The specified-#,, and unspecified-#,, testing problems are two distinct statistical prob¬ 
lems, that, even in the low-dimensional case considered, provide different efficiency bounds. 
In low dimensions, the Rayleigh test is optimal for the unspecified-#,, problem, but not for 
the specified-#,, one (the latter suboptimality follows from the fact that the asymptotic 
powers in (3.5) are strictly smaller than those of the optimal test in (3.3)). This thoroughly 
describes the optimality properties of this test in the low-dimensional case, so that we may 
now focus on the high-dimensional case. 

4.2. The high-dimensional case. If p n goes to infinity, then the dimension of the pa¬ 
rameter (#„, k) increases with n, so that there cannot be a high-dimensional analogue 
of the LAN result in Theorem 4.1. We therefore rather adopt, in the present hypothesis 
testing context, an invariance approach that is close in spirit to the one used by Moreira 
(2009) in a point estimation context. 

The null of uniformity and all collections of alternatives : = {P^)( ^ : # E 5 p,1_1 } 
(hence also the problem of testing uniformity against rotationally symmetric alterna¬ 
tives itself) are invariant under the group of rotations := {g^ : O E SO(p n )}, 
where ^(xi,... ,x„) = (Oxi,..., Ox„) for any (xi,...,x„) E S Pri_1 x ... x 
(n times) and where SO(jp n ) stands for the collection of p n x p n orthogonal matrices with 
determinant one. The invariance principle (see, e.g., Shao (2003), Section 6.3, or Lehmann 
and Romano (2005), Chapter 6) then suggests restricting to invariant tests, that 
automatically are distribution-free under any . 

As usual, optimal invariant tests are to be determined in the image of the original 
model by a maximal invariant T„ of Q^ n \ The likelihood (with respect to the surface area 
measure m Pn on 5 Pn_1 ) associated with the image of T’^'j j by T n is given by 



where the integral is with respect to the Haar measure on SO(p n ); see, e.g., Lemma 2.5.1 
in Giri (1996). The resulting log-likelihood ratio to the null of uniformity is therefore 

, C Pn,K„,/ IsO(p n ) n?=l /(KnX m (0'#„)) do 

lQ g - Tn - 

Pn 

, nr =1 /(^xcu)|x nl ,... ,x nn ] 

lQ g -At-> 

Pn 


A nJ ■= lQ g 


dP 


(n)T„ 

K n,f 


dP 


(n) 


(4.1) 
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where U is uniformly distributed over and is independent of the X m ’s. The following 

theorem shows that, in the FvML case /(•) = /fvMl(') = exp(-), this collection of log- 
likelihood ratios enjoys the LAN property. 


Theorem 4.2. Let (jp n ) be a sequence of positive integers diverging to oo as n —> oo 
and let n n = T n p^J A /y/n, where the positive sequence (r n ) is 0(1) but not o(l). Then, 
as n —> oo under Pq , we have that 


(4.2) 


jp( n )Tn 

KhJfvML _ J2 a (n)T„ 

dP<"> “ " 


+ Op(l), 


where A* n ) Tn := R^)/y/2 is asymptotically normal with mean zero and variance 1/2 
(Rf) is the standardized Rayleigh test statistic in (1.1)). 


Applying Le Cam’s third lemma, we obtain that, as n —> oo under P*" ML ) with K n = 
rpn^/y/n, A( n ) Tn converges weakly to the normal distribution with mean Tt 2 and vari¬ 
ance T, with T = 1/2. The model {P^j-J^ : k > 0} (where Po/jf^L := ^*o^) thus 
“second-order” LAN, in the sense that the mean of the limiting Gaussian shift exper¬ 
iment is quadratic (rather than linear) in r. Clearly, this does not change the form of 
locally asymptotically optimal tests, but only their asymptotic performances. Note that 
the contiguity rate n n ~ pff /4 /yfn associated with this new LAN property differs from the 
contiguity rate n n ~ \fpnfn in Theorem 3.1. 

Theorem 4.2 entails that the test rejecting the null of uniformity at asymptotic level a 
whenever A( n ) Tn /\/r = R,f) > z a (that is, the high-dimensional Rayleigh test in (1.2)) is, 
in the FvML case, locally asymptotically most powerful invariant , that is, locally asymp¬ 
totically most powerful in the class of invariant tests. This optimality result is of a high¬ 
dimensional asymptotic nature and also covers cases where K n does not converge to 0, hence 
does not follow from the aforementioned local optimality result from Chikuse (2003). Le 
Cam’s third lemma readily implies that Rf) converges weakly to the normal distribution 
with mean T 2 /y /2 and variance one as n —> oo under P^^ ML , with n n = rpn^/y/n, so 
that the corresponding asymptotic power of the Rayleigh test is given by 

= 1 - »(*> - 


where the sequence (9 n ) is such that 0 n E S Pn ~ l for all n but is otherwise arbitrary. While 
the Rayleigh test is blind to alternatives in n n ~ yjpn/n, it thus detects alternatives 
in n n ~ pf/ 4 / yfn, which, in view of Theorem 4.2, is the best that can be achieved for the 
unspecified- 0 .„ problem. 

Interestingly, we might have guessed that these alternatives in n n ~ Pr / 4 /\fn are those 
that can be detected by the high-dimensional Rayleigh test. Recall indeed that heuristic 
arguments in Section 3 suggested that, under Pq 1> t , p with K n = Ty/pjn, the distribution 
of Rff is close to AA(^^ , 1 + =)—) for large n and p. Consequently, to obtain, in high di¬ 
mensions, an asymptotic non-null distribution that differs from the limiting null (standard 
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normal) one, we need to consider alternatives of the form Pi^ with n n = rrv / 4 /\/n, 
under which the distribution of is then expected to be approximately , l) for 

large n and p. This is fully in line with the non-null distribution and local asymptotic 
powers obtained from Le Cam’s third lemma in the previous paragraph. 

Provided that / is four times differentiable at 0 and that p n = o(n 2 ), tedious computa¬ 
tions allowed to show that a fourth-order expansion of the /-based log-likelihood ratio 
above, still based on K n = T n p n /\/n, exactly provides the righthand side of (4.2), with 
the same central sequence A^ n l Tn . However, turning this into a proper /-based version 
of Theorem 4.2 requires controlling the corresponding (fifth-order) remainder term, which 
proved to be extremely difficult. Yet we conjecture that Theorem 4.2 indeed extends 
to an arbitrary / admitting five derivatives at 0 , under the aforementioned assumption 
that p n = o(n 2 ) (an assumption that is superfluous in the FvML case, since Theorem 4.2 
allows p n to go to infinity in an arbitrary way as a function of n). Proving this conjecture 
would establish that the Rayleigh test is locally asymptotically most powerful invariant 
under any such /, with the same asymptotic powers as in (4.3). Since this remains a con¬ 
jecture, we now study the asymptotic powers of the high-dimensional Rayleigh test away 
from the FvML case. 

5. Asymptotic non-null behaviour of the Rayleigh test. In this section, we 
derive the asymptotic distribution of the high-dimensional Rayleigh test under rotation- 
ally symmetric distributions that encompass those considered in Sections 2-4. Here we 
do not require that the rotationally symmetric alternatives are monotone (in the sense 
of Section 2), nor absolutely continuous with respect to the surface area measure on the 
unit sphere, nor that they involve a concentration parameter k. Yet one of our objec¬ 
tives is to interpret the results of this section in the light of the contiguity/LAN/rate- 
consistency/power results obtained above. 

More specifically, the sequences of alternatives we consider in this section are described 
by triangular arrays of observations X n j, i = 1,..., n, n = 1,2,... such that, for any n, 
X n i, X„ 2 , • • •, X nn are mutually independent and share a common rotationally symmetric 
distribution on 5 Pn_1 . We denote by P ^"' 1 F the corresponding hypothesis when X n! ; is 
rotationally symmetric about 0 n and X' ni 0 n has cumulative distribution function F n . Since 
the Rayleigh test statistic is invariant under rotations, we will, without loss of generality, 
restrict to the case for which 0 n , for any n, coincides with the first vector of the canonical 
basis of M Pn . The corresponding sequence of hypotheses will then simply be denoted as P// . 

Under the null of uniformity (which we still denote as Pq^), the test statistic Rff in (1.1) 
has mean zero and variance p /4 l (—> 1). Rotationally symmetric alternatives are expected 
to have an impact on the asymptotic mean and variance of This is made precise in 
the following result (see Appendix B.l for a proof). 

Proposition 5.1. Under P^/\ E[/?,(/] = (n — l) v /p^e 2 1 /v / 2 and cr 2 := p n e 2 2 + 
2 np n e 2 1 e n 2 + / 2 2 = VarfR^] + o(l) as n —>• oo, where the expectations e n e '■= E[(X' m 0„) f ], 
e nl := E[(X' ni 0 n - e n i) 1 ], and f ni := E[(l - (XU0 n ) 2 ) £ / 2 ] are taken under P^. 
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Under Pg , e n \ = 0 and e n2 = e n 2 = 1 /p n , so that Proposition 5.1 is compatible 
with the null values of Efi?®*] and Varfi?®*] provided above. Now, parallel to the null case 
(see Theorem 1.1), the Rayleigh test statistic, after appropriate standardization, is also 
asymptotically standard normal under a broad class of rotationally symmetric alternatives. 
More precisely, we have the following result (see Appendix B.2 for a proof). 

Theorem 5.1. Let (p n ) be a sequence of positive integers diverging to oo as n — >• oo. 
Assume that the sequence (P? ) is such that, as n oo, (i) min ( Pn t n2 , ) = oil), 

(a) e n4 /e ^2 = °{n ) and (Hi) fn*/f %2 = °( n )- Then, under P^f, - E [Rff])/a n = 
El <i<j<n ( x ni x «i - 4i) ” AT(0,1) as oo. 

This result applies under very mild assumptions, that in particular do not impose ab¬ 
solute continuity nor any other regularity conditions. The only structural assumptions are 
the conditions (i)-(iii) above. These, however, may only be violated for rotationally sym¬ 
metric distributions that are very far from the null of uniformity (hence, for alternatives 
under which there is in practice no need for a test of uniformity). Indeed, a necessary — 
yet far from sufficient — condition for (i)-(iii) to be violated is that ~X! nl 6 n converges in 
probability to some constant c(e [—1,1]). Moreover, in the FvML case, the conditions (i)- 
(iii) always hold, that is, they hold without any constraint on the concentration n n nor 
on the way the dimension p n goes to infinity with n (the proof of this statement is very 
lengthy and requires original results on modified Bessel functions ratios, hence is provided 
in the supplementary article Cutting, Paindaveine and Verdebout (2015)). 

Theorems 5.1 allows to compute the asymptotic power of the Rayleigh test under 
appropriate sequences of alternatives. As mentioned above, the null of uniformity Ro n 
yields e n i = 0 and e n2 = 1 /p n - Here, we therefore consider “local” departures from uni¬ 
formity of the form R\ n : {P^ : e n \ = 0 + u n T, e n2 = ( 1 /Pn) + Cnh}' The following 
result provides the asymptotic power of the high-dimensional Rayleigh test in (1.2) under 
sequences of local alternatives that, as we will show, are intimately related to those we 
considered in Sections 3-4 (see Appendix B.2 for a proof). 


Theorem 5.2. Let (p n ) be a sequence of positive integers diverging to oo as n 
Let the sequence (P^ *) satisfy the assumptions of Theorem 5.1 and be such that 


oo. 


(5.1) 


€nl — 


n 1 / 2 ^ 4 


+ o 


n 1 / 2 pl/ 4 


and 


e n 2 = -b o — 

Pn \p n 




(n) 

for some r > 0. Then, under P F f, the asymptotic power of the high-dimensional Rayleigh 
test in (1.2) is given by 1 — — ^=). 


In order to link these alternatives to those considered earlier, note that, as n —> oo 
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under P^ n K ^ with n n = £ n \fpnfn , where the positive sequence (£ n ) is o(y/n), we have 

— / (1 - s 2 )^ 3 ^ 2 K n sf(K n s)ds 
Kn J — 1 


^nl — 


C Pn,Kn,f 


(5.2) 

and 


= l + ^/"( 0 ) + oP 


2p 


Pn 


-1 


. I - ' ii- 

-ho — 


Pn 


Pn 


Cn2 — 


-'Pn 


C Pn,Kn,f 


) % [ (1 -S 2 ) (Pn 3)/2 (K n s) 2 f(n n s)ds 

J «n 7-1 


(5.3) 


= l + ^/"( 0 )+o 


2 p 


/C 


Pn 


-1 


—+ 0 — 


Pn 


Pn 


where we used four times Lemma A.l. For the contiguous alternatives in Theorem 3.1, 
>M 

On,KnJ' 


that is for .. f , with n n = T n yjp n /n (where (r n ) is bounded), (5.2)-(5.3) provide 


(5.4) 


T n 


C-nl — 


+ 0 


\sjrvp^ 


1 


and 


1 


&n2 =-hO - 


Pn 


1 


Pn 


Theorem 5.2 implies that the asymptotic power of the high-dimensional Rayleigh test 
under the alternatives (5.4) is equal to a, which confirms (see Section 3) that this test is 
blind to contiguous alternatives. 

Now, at least if p n = o{n 2 ) (a constraint that is actually superfluous in the FvML case, 
as it can be seen by using the Amos-type bounds provided in Lemma S.3.2 from Cutting, 
Paindaveine and Verdebout (2015)), the more severe alternatives Pi ”" 1 ,, with K n = 

Tp n /\/n, from Theorem 4.2 translate — still in view of (5.2)-(5.3) — into those in (5.1). 
This shows that the asymptotic powers of the high-dimensional Rayleigh test computed 
in the FvML case via Le Cam’s third lemma (see (4.3)) actually also hold away from the 
FvML case. Clearly, this further supports the conjecture from Section 4.2 that, under the 
assumption that p n = o(n 2 ), Theorem 4.2 holds for an essentially arbitrary /. 

6 . A Monte Carlo study. In this section, we present the results of a Monte Carlo 
study we conducted to check the validity of our asymptotic results. We performed two 
simulations. In the first one, we generated independent random samples of the form 


( 6 . 1 ) 


X 




* = 1,..., n, J = 1,2, £ = 0,1, 2, 3,4. 


(£) . 

For l = 0, the common distribution of the X}.,”s is the uniform distribution on the unit 

L iJ 

sphere 5 P_1 , while, for i > 0, the X^-’s have an FvML distribution on with location 




0 = (1,O,...,O)'g 


and concentration k^\ with 


M - 


= 0 . 6 £ a/ — 

n 


and tSp = 0 . 6 £ ^ 


,3/4 


n 
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In the second simulation, we considered again independent random samples of the form (6.1), 
still with ! s that are uniform over 5 P_1 . Here, however, the X.-^-’s, for l = 1,2, 3, 4, are 
rotationally symmetric with location 0 = ( 1 , 0 ,..., 0 ) / £ M p and are such that the 0'X^J’s 
are beta with mean ejf^ and variance e 2 -j = 1/p, where we let 


JQ = 

1:1 y/n.V 


and 



0 . 6 £ 

n !/2pl/4 


(this beta example is associated with a non-monotonic nuisance /, which is allowed in 
Section 5). In both simulations, the value l = 0 corresponds to the null hypothesis of 
uniformity, while l = 1,2, 3,4 provide increasingly severe alternatives. The case j = 1 
relates to the contiguous alternatives (see Theorem 3.1) and the corresponding (more 
general) alternatives in (5.4), whereas j = 2 is associated with the alternatives under which 
the Rayleigh test shows non-trivial asymptotic powers in the high-dimensional setup (see 
Theorem 4.2 and the alternatives (5.1)). 

For any (n,p) E C X C, with C : = {30,100,400}, any j E {1,2}, and any £ E 
{0,1, 2, 3,4}, we generated M = 2,500 independent random samples X^j, i = 1,..., n, as 
described above, and evaluated the rejection frequencies of (i) the specified-0 n test fig 
in (3.2) and of (ii) the high-dimensional Rayleigh test j , / n ' ) in (1.2), both conducted at 
nominal level 5%. Rejection frequencies are plotted in Figures 1 and 2, for FvML and 
beta-type alternatives, respectively. In each figure, we also plot the corresponding asymp¬ 
totic powers, obtained from (3.3), (4.3), Theorem 5.2, and the fact that (j)^ is consistent 
against (j = 2)-alternatives. 

Clearly, for both simulations, rejection frequencies match extremely well the correspond¬ 
ing asymptotic powers, irrespective of the tests and types of alternatives considered (the 
only possible exception is the test (p^/' under {£ = 1 ,j = 2 )-alternatives; this, however, is 
only a consequence of the lack of continuity of the corresponding asymptotic power curves). 
Remarkably, this agreement is also reasonably good for moderate sample size n and di¬ 
mension p. Beyond validating our asymptotic results, this Monte Carlo study therefore 
also shows that these results are relevant for practical values of n and p. 


7. An application. Since the seminal paper Ledoit and Wolf (2002), one of the most 
widely considered testing problems in high-dimensional statistics is the problem of testing 
for sphericity. A possible approach to test for sphericity about a specified centre (without 
loss of generality, the origin of M p ) is to perform a test of uniformity on the sphere 5 P_1 
on “spatial signs”, that is, on the observations projected on 5 P_1 ; see, among others, 
Cai, Fan and Jiang (2013), where this is used in a possibly high-dimensional setup, and 
Cuesta-Albertos, Cuevas and Fraiman (2009), where it is argued that “in most practical 
cases the violations of sphericity will arise from the non-fulfillment of uniformity on the 
unit sphere for projected data ”. This is particularly true in the high-dimensional case, since 
the concentration-of-measure phenomenon there implies that information lie much more 
in the directions of the observations from the origin than in their distances from the origin 
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n=30 n=100 n=400 











f t t 

Fig 1. Rejection frequencies (dashed) and asymptotic powers (solid), under the null of unifor¬ 
mity over the p-dimensional unit sphere (£ = 0) and increasingly severe FvML alternatives 
(i = 1,2,?>,&), of the specified-6 n test (fig 1 ' 1 in (3.2) (red/orange) and the high-dimensional Rayleigh 
test <j/ n 'l in (1.2) (light/dark green). Light colors (orange and light green) are associated with 
contiguous alternatives, whereas dark colors (red and dark green) correspond to the more severe 
alternatives under which the Rayleigh test shows non-trivial asymptotic powers in high dimensions; 
see Section 6 for details. 
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n=30 n=100 n=400 











f t t 

Fig 2. Rejection frequencies (dashed) and asymptotic powers (solid), under the null of uniformity 
over the p-dimensional unit sphere (l = 0) and increasingly severe “beta” rotationally symmet¬ 
ric alternatives (i = 1,2,3,4/, of the specified-6 n test <f>^ in (3.2) (red/orange) and the high¬ 
dimensional Rayleigh test in (1.2) (light/dark green). Light colors (orange and light green) 
are associated with contiguous alternatives, whereas dark colors (red and dark green) correspond to 
the more severe alternatives under which the Rayleigh test shows non-trivial asymptotic powers in 
high dimensions; see Section 6 for details. 
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(incidentally, note that Juan and Prieto (2001) also invoked the same argument to adopt 
a directional approach for outlier detection in high dimensions). 

As showed in the previous sections, the high-dimensional Rayleigh test will show power 
against skewed rotationally symmetric distributions on the sphere (skewness arises from 
the nronotonicity of the corresponding nuisance /). On the contrary, the Rayleigh test will 
be blind to any non-spherical distribution in MP whose projection on the sphere charges 
antipodal regions equally. In particular, it will show no power against elliptical alternatives, 
hence also against spiked alternatives (that is, against alternatives associated with scatter 
matrices of the form £ = a (Ip + A fi/3'), with a, A > 0 and /3 E 5 P_1 ). Interestingly, most 
(if not all) tests for sphericity in high dimensions are designed to detect elliptical or spiked 
alternatives. This is the case, e.g., both for the Gaussian sphericity test (4$, say) from 
John (1972) and for the sign test of sphericity ((f>g , say) from Hallin and Paindaveine 
(2006) (these tests were shown to be valid in high-dimensions in Ledoit and Wolf (2002) 
and Zou et al. (2014)/Paindaveine and Verdebout (2015), respectively). In line with this, 
theoretical efforts have so far focused on spiked alternatives; see, e.g., Onatski, Moreira 
and Hallin (2013, 2014), where powers of various tests of sphericity under high-dimensional 
spiked alternatives were investigated. 

To illustrate these antagonistic power behaviours, we performed the following simula¬ 
tion exercise involving the Gaussian sphericity test (jyff and the sign sphericity test qbg 
(both in their version to test for sphericity about the origin of MP), as well as the high- 
dimensional Rayleigh test 4>n - For l = 0,1,2,3,4 and n = p = 100, we generated 10,000 
p-dimensional independent samples X , i = 1 ,... ,n, and xjj^, i = 1 ,..., n, from two 
different alternatives to sphericity : 

(i) X'J, i = , n form a random sample from the p -variate skew-normal distribution 

with location vector 0, scatter matrix I p and skewness vector (£,... ,£)' E MP; see 
Azzalini and Capitanio (1999); 

(ii) X).^ , i = 1 ,..., n form a random sample from the p -variate normal distribution with 
mean 0 and covariance matrix I p + £e\ ej, with ei = (1,0,..., 0)' E M p . 

For both (i)-(ii), £ = 0 is associated with the null of sphericity about the origin of M p , 
whereas l = 1,2, 3,4 provide increasingly severe alternatives. Figure 3 plots the resulting 
empirical powers of the three tests mentioned above, all performed at nominal level 5%. Re¬ 
sults confirm that the Rayleigh test performs quite well under alternatives of type (i) 
but shows no power against alternatives of type (ii), whereas the tests and qb^ do 
the exact opposite. In practice, thus, as soon as the Rayleigh test and more standard tests 
of sphericity lead to opposite rejection decisions, practitioners are offered some insight on 
what type of deviation from sphericity they are likely to be facing. 

We illustrate this on a real data example. We considered the gene expression dataset an¬ 
alyzed in Eisen et al. (1998); more precisely, we restricted to a subsample of 100 ribosomal 
proteins from this dataset. The data then take the form of a matrix X = ( Xij ), where Xij 
is the jth expression value (j = 1 ,... ,p = 79) of the zth gene (i = 1,... ,n = 100) (even 
though n > p, the present data may be considered high-dimensional since the small value 
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Skew-normal alternatives Elliptical/spiked alternatives 




l l 

Fig 3. (Left:) Rejection frequencies, under the null of sphericity in (£ = 0) and increasingly 
severe skew-normal alternatives (£ = l,2,3,4j, of the Rayleigh sphericity test (green), along 
with the more classical sphericity tests (f>^ (orange) and qbffl (red). (Right:) The corresponding 
rejection frequencies under some p-variate spiked alternatives. In both cases, the dimension p and 
the sample size n are equal to 100, the nominal level is 5%, and the number of replications is 10, 000; 
see Section 7 for details. 


of n/p prevents relying on fixed-p asymptotic results). The rows of X (the “expression 
vectors”) are obtained from DNA microarray experiments. After imputing missing data 
(by replacing any missing entry in X with the sample average of available measurements 
on the same variable) and centering the observations via the sample average, we performed 
the Rayleigh test and its competitors and <f>^. While centering still leaves some 
space for rejection by (that is indeed based on the sample average of projected ob¬ 
servations), the Rayleigh test interestingly provides a p -value above .9999, whereas those 
of cpfj- and (jr s are below .0001. Hence, at any usual nominal level, the null of sphericity 
is rejected, and the outcome of the various tests suggest that the deviation from sphericity 
is of an elliptical, or at least of a centrally symmetric, nature. This may be useful to guide 
further modelling of this gene expression dataset. 

8. Conclusions and perspectives. In this final section, we summarize the results 
of the paper and present perspectives for future research. 

8.1. Summary. We considered the problem of testing uniformity on the unit sphere 
in the low- and high-dimensional setups. Rotationally symmetric alternatives with modal 
location 0 n , concentration n n and functional parameter / were considered. We showed 
that K n ~ y/pn/n provides contiguous alternatives. For specified 6 n , a local asymptotic 
normality result was established (at the aforementioned contiguity rate), which allowed, 
both in low and high dimensions, to define locally asymptotically most powerful tests for 
the specified-#™ problem. 
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Iii practice, however, #„ may rarely be assumed to be known. In the corresponding 
unspecified-#,, problem, we showed that the Rayleigh test enjoys nice asymptotic optimal¬ 
ity properties, both in the low- and high-dimensional cases. In low dimensions, it is locally 
asymptotically maximin, irrespective of /. In high dimensions, it is locally asymptotically 
most powerful invariant in the FvML case, and a conjecture — that is strongly supported 
by a fourth-order expansion of the relevant /-based local log-likelihood ratio and by the 
computation of asymptotic powers in Section 5 — states that, provided that p n = o(n 2 ), 
this optimality holds for any / that is five times differentiable at 0. 

Our results fully characterize the cost of the possible unspecification of #„. In low 
dimensions, this cost is in terms of asymptotic powers but not in terms of rate. In high- 
dimensions, however, there is a cost in terms of rate, as optimal tests cannot detect the 
contiguous alternatives in n n ~ y/pn/n , but only the more severe alternatives in n n ~ 
Pr/ 4 /\J n - Simulation results are in remarkable agreement with our asymptotic results, 
irrespective of the relative magnitude of n and p — which materializes the robustness of 
most of our results in the rate at which p n goes to infinity with n. A real data example 
illustrated the usefulness of the high-dimensional Rayleigh test in the framework of testing 
for sphericity. 

8.2. Perspective for future research. In the distributional framework described in Sec¬ 
tion 2 , the problem of testing uniformity consists in testing the null hypothesis that the 
concentration parameter n n is equal to zero. Depending on the information at hand, the 
other parameters, namely the modal location 0 n and the infinite-dimensional parameter /, 
may be regarded as specified or unspecified. If / is specified, then the problem is of a para¬ 
metric nature and optimality quite naturally relates to the local asymptotic normality of 
the corresponding fixed-/ submodel (both the specified- and unspecified-#,, parametric 
problems can be considered). We showed that, for any sufficiently smooth / in a neigh¬ 
bourhood of the origin, the test in (3.2) and the Rayleigh test achieve the /-parametric 
efficiency bounds in the specified- and unspecified-#.,, problems, respectively. 

Since it can hardly be assumed in practice that / is known, it is more natural to adopt 
a semiparanretric point of view under which / remains unspecified. The optimality results 
stated in this paper should then be read in a semiparametric sense, under unspecified / 
in the specified-#,, problem, and under unspecified (# n , /) in the unspecified-#,, one. In all 
cases, such results are pointwise in (#„, /) and relate to the corresponding semiparametric 
efficiency bounds at (#„,/); see, e.g., Bickel et al. (1998). In the present setup, it is not 
needed to go through tangent space calculations to derive the resulting semiparametrically 
optimal tests; indeed, since the test in (3.2) and the Rayleigh test are parametrically 
optimal at any smooth / (for the specified- and unspecified-#,, problems, respectively), 
they also are semiparametrically optimal at such /. Another corollary of our results is 
that semiparametrically efficiency bounds at (#,,, /) do not depend on (#„, /) but differ in 
the specified- and unspecified-#.,, problems. 

Now, the problem of testing uniformity over the unit sphere is primarily of a nonpara- 
metric nature. Even if the distributional framework described in Section 2 is considered, 
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it is therefore valid to adopt a nonparametric point of view and to try to identify, e.g., 
minimax separation rates ; see, e.g., Ingster (2000) or, in a directional context, Fay et al. 
(2013), Lacour and Ngoc (2014) and Kim, Koo and Ngoc (2016). This approach is funda¬ 
mentally different from the semiparametric one adopted in this work. In particular, instead 
of providing pointwise results in (#,,. /), this approach aims at identifying consistency rates 
that are associated with the worst-in-f (resp., worst-in-(0 n , /)) performances that can be 
achieved in the specified-#,, (resp., unspecified-#,,,) problem. This fundamental difference 
between the semiparametric and nonparametric approaches above does not make it possi¬ 
ble to translate our results in terms of minimax separation rates. Nevertheless, preliminary 
results indicate that, at least for the specified-#,, problem, the consistency rates described 
in this paper are also minimax separation rates and that the test is “rate-optimal in 
the minimax sense” (obviously, it would be natural to consider further the unspecified-#,, 
problem). These results, however, require much work and rely on other techniques than 
those considered in the present paper, hence will be presented elsewhere. 

APPENDIX A: PROOFS FOR SECTIONS 2 TO 4 
The proofs of Theorems 2.1 and 3.1 require the following preliminary result. 


Lemma A.l. Let g : M —> M be twice differentiable at 0. Let n n be a positive sequence 
that is o(y/pff) as n -A oo. Then R n (g) := c Pn /^ 1 (l — s 2 )^ Pn ~ 3 ^ 2 g(K n s) ds = g( 0) + 

£/( 0 ) + o (£). 

Proof of Lemma A.l. First note that, if X is uniformly distributed over (hence 

is such that Var[X] = (l/p n )I Pn ), then (2.1) implies that 


(A.l) 


u Pn 
»~1 


r T i 

/ s 2 (l - s 2 )(p™-3)/ 2 ds = E[(# / X) 2 ] = #'E[XX']# = —, 

J-1 Pn 


where # £ S Pn 1 is arbitrary. Now, write 


Rn(g) = g{ 0) + c Pn / (1 - s 2 ) (p " 3)/2 (g{n n s) - g( 0) - K n sg'(0))ds. 
Letting t = n n s and using the identity (A.l) with p = p n then provides 


r 


R n {g) = 5(0) + ^ / K{t) 

Pn J-K, n 


g(t) - g{ 0) - tg'{ 0) 
t 2 


dt, 


where h n is defined through 


(^) 2 (! ~ & iPn - 3)/2 


t h n (t) = nK ' Kn ' ' - KKn ' ' , -^-I[|t| < Kn\. 

n \ / rK„ , _ ^_s_^2^(p„-3)/2 U I — n J 


It can be checked that, since K n = o(y/pn), the h n 's form an approximate 5-sequence , in 
the sense that h n (t ) dt = 1 for all n and ff_ h n (t) dt —> 1 for any e > 0. Hence, 


R-r, 


(g) = 9 (0) + ^ lim ( gW-gW-VW ) +0 (ti\ 

Pn ^0 V t z J \p n J 
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which, by using L’Hopital’s rule, yields the result. 


□ 


Proof of Theorem 2.1. In this proof, all expectations and variances are taken un- 

(n) 

der the null of uniformity Pq ' and all stochastic convergences and op’s are as n —> oo 

(n) 

under Pq . Consider then the local log-likelihood ratio 


A n := log 


dP 


( n) 

^nAru/ 


dP 


2 log 


(A.2) 


(n) 

0 i= 1 

C Pn,K n ,f 


c Pn,Kn,f f ( K nX- ni 6r 

C Pn 

n 


= n {log 


Pn 


+ Enl J + ^2 ( 1°§ f{ K ri^nfin) ~ E'n\) L n l + L n2 ; 


i =1 


throughout, we write If.k ■= (log f) k and E n k := E [£f t k(K n X. , ni 6 n )] ( E nk actually depends 
on n n , p n and /, but we simply write E nk to avoid a heavy notation). 

Lemma A.l readily yields 

1 (1 _ s 2 ){ p „- 3 )/ 2 /( ^ s) ^ 


n log Cpn ^ n ' f = - n log ( c Pn 




/-I 

.2 


(A.3) = -nlog(l + ^/''(0) + O (^V) =-^/"(0) + o( m ^). 

\ ^Pn 'Pn ' / * P n ^ Pn ' 


Similarly, for any positive integer k, 
(A.4) E n k = c Pn f 


L (1 _ s 2 )(Pn — 3 )/ 2 ^ M ds = 0 ) + o(^). 

1 2p„ \p n J 


Combining (A.3) and (A. 4), and using the identity 1"^ j (0) = f"( 0) — 1 readily yields 


Lnl — 


nnz 


Turning to L n 2 , write 


-r(o)+<!,,(ii))+.(^) 


n/v„ 


+ o 


nKt 


Pn 


An2 — \JEP n ^ ^ Wni '■ — \/R^n ^ ^ 


2 pn 

log/(KnXC0„) - E n \ 


i=l i— 1 

where we let E n := Var [ log/(RnX^ ■(?„)] • First note that (A.4) provides 


(A.5) nV n = n(E n2 - E 2 nl ) = ”^^ 2 (0) + o(^) 

APn ' Pn ' 

which leads to 




Pn 


+ O 


/ riK. 


t 


Pn 


(A. 6 ) 


A,, = - 


2 p„ 


+ 




Pn 


+ O 




2=1 


Pn 


Since lT n j, i = 1,... ,n are mutually independent with mean zero and variance 1/n, we 
obtain that 


(A.7) E[AJ] = (E [A „]) 2 + Var [A„] = ^ + o(^) 

^rn V Jrn / 


'n 2 ni - 


nR„ 

+ —- + 0 
Pn 


nut 


Pn 
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If = o(y-), then (A.7) implies that exp(A n ) —> 1 , so that Le Cam’s first lemma yields 
that P g^ Kn j and Pq"' 1 are mutually contiguous. 

We may therefore assume that n 2 n = T 2 p n /n, where the positive sequence (r n ) is 0(1) 
but not o(l). In this case, (A. 6 ) rewrites A n = — \ + o(l) )T )” =1 W n i + o(l). 

Applying the Cauchy-Schwarz inequality and the Chebychev inequality, then using (A.4) 
and (A. 5), provides that, for some positive constant C, 


> £]] < n^E[W^}P[\W ni \>s) < ^E[W^y a r[W ni } 



< 



= o(l), 


where we have used the fact that P) 4 (0) = 0. This shows that ^)” =1 W n i satisfies the clas¬ 
sical Levy-Lindeberg condition, hence is asymptotically standard normal (as already men¬ 
tioned, W n i, i = 1, • • • j n are mutually independent with mean zero and variance 1/n). For 
any subsequence (exp(A nm )) converging in distribution, we must then have exp(A nm ) —> 
exp(Y’), with Y ~ W(-| lim^oo r 2 m , lim^oo T% m ). Mutual contiguity and P^ n) 

then follows from the fact that P[exp(Y) = 0] = 0 and E[exp(E)] = 1. □ 


Proof of Theorem 3.1. As in the proof of Theorem 2.1, all expectations and variances 

(n) 

in this proof are taken under the null of uniformity Pq and all stochastic convergences 

(n) 

and op’s are as n — > oo under Pq . The central limit theorem then directly establishes 
Part (ii) of the result, since E[A^] = 0 and Var[A^] = ^Var [ XO0 n ] = 1. 

It therefore remains to establish Part (i). Recall that, in the case where (r n ) is 0(1) but 
not o(l), we have obtained in the proof of Theorem 2.1 that 

2 n 2 n 

A n =- 2 " \Z r n + °(1) W n i + o(l) =-y + T n W n i + Op(l), 

2—1 2—1 


where ^ 7=1 Wm = (l/\/nVn) X)jLi(log f( K nX' ni O n ) — E n 1 ) is asymptotically standard nor¬ 
mal. To establish the result, it is therefore sufficient to show that r n [(^)" =1 W n j) — A^ 
converges to zero in quadratic mean. To do so, write 


Tn(Yl W ™) 

2—1 


r A (n) - 


M 

J2 (log/(« n XO0n) - E n 1 - ^VnXJn) ='■ ~^=- 

i =1 VnVn 


Then using EfXP^n] = 0 and E[(X( ll 0 n ) 2 ] = 1 /p n , we obtain 

E [M 2 \ = nr 2 E [(log /(« n X' ni 0 n ) - E nl - ^V n Ki^nf] 

= nr 2 (2V n - 2^/pnVn E[XP0 n (log f(K n X' ni 0 n ) - E nl )]) 
= 2riT 2 Vn - 2T n n 3/2 y/VnE[K n X.' ni 0 n log f(K n X.' ni 0n)\, 
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which, letting g(x) := x(log f{x)), provides 


(A. 8 ) 


E 


'y ^ r n A ( 


i =1 


4 n) 

& n 


2i 


_ 9 2 2r « n 

^ X^Vn 


E[g(KnX ni 0r 


Using Lemma A.l, 


E[g( Kn KA)\ = c Pn I" (1 - s 2 )^-^ 2 g( Kn s) ds = ^g"( 0) + o(^) = ^ + o(^) . 

J — 1 A Pn ' Pn ' Pn ' Pn ' 

Plugging in (A. 8 ) and using (A.5) then yields 


E 


T~n ( ^ ^ Wni ) U? A 


4 n) 

t'n 


2 i 


o_2 + °U" 

= 2r„- 


nit?. . / nre£ 

Pn l Pn 


W = o(1) ’ 


i= 1 

as was to be showed. □ 

Proof of Theorem 4.2. The FvML version of the log-likelihood in (4.1) rewrites 

( A ‘ 9 ) A n,/ Fv ML =nl °g 

where 


‘ T " - " + logE[expKnX;U)|X„] =: L nl + L n2 , 

C Pn 


r(^y)Tpn_i(fi; n ) 

L n i = -nlog—-- - £a _ — =: -nlogHp^^Kn) 

(«n/2) 2 2 

(see ( 2 . 1 )-( 2 . 2 ) for explicit expressions of c p and c PjK j F vML = CpX ML > respectively) and 


Ln 2 = log E 


I II v | U, X»HV 

exp ( nK n \\jL n \\ - „ ) |X ri 


= log ( c Pn / (1 - s 2 ) (p " 3)/2 exp(n/c n ||X n ||s) ds 


i-i 


= log 




.FvML 


= logiLpn._ 1 (n/c n ||X n ||) =: logHen.-i(K n VnT n ). 


Pn ,TlKn || 

Now, using the bounds S u+ i ^, 3 ( 4 ) < log-fZ^/c) < ( K ) (see (5) in Hornika and 

Grrin, 2014) with S a ^ s(k) := \Jk 2 + (5 2 — a\og(a + \Jn 2 + j3 2 ) — (3 + adog(a + /3), one 
easily obtains that, if n n = r n p, 3 / 4 /-^/n, with n,p n —> oo and (r n ) bounded, 


(A.10) L n \ + HKn 


nn2 


2 Pn 4p 2 (p n + 2) 


A T nK lrr2 , n2fi n ^4 m 

= 0 (1) and L„ 2 -—T« + w —^T„=o p(D 


under Pq 9 . The hrst (resp., second) result in (A. 10) requires expanding the log term in 
the S at p bounds as log x = x — 1 — ^(x — l ) 2 (resp., logx = x — 1 — \ (x — l ) 2 + ^z(x — l) 3 ) 
for some c between x and 1, and the second one further requires using the fact that T n = 
l + op(l) as n —> oo under Pq 9 (which directly follows from Theorem 1.1). Plugging (A. 10) 
into (A.9) and using again the fact that T n = 1 + op(l) entails that, as n —> oo under Pq "' 1 , 


UkI UK 2 n 2 n 2 K i n 4 _2 R n 

+ x— U. - —-— 4 + op(l) = T n—^~ 


n,/ FvM L 2 p n 1 2 p n ^ n 4p2(p n + 2) 

Jointly with Theorem 1.1, this establishes the result. 


\/2 4 (p n + 2) 


+ o P (l). 


□ 
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APPENDIX B: PROOFS FOR SECTION 5 

In this second appendix, we establish Proposition 5.1 and Theorems 5.1-5.2. 

B.l. Preliminary lemmas and proof of Proposition 5.1. Define the quantities 
u n i '■= XC0 n and v n i := (1 — rtjb) 1 / 2 that are associated with the tangent-normal decom¬ 
position X-ni = U n i6n~\~ 'Oni^ni of X n j, where - = (X n j CX. n jOn)@n) / 1| X n j (^X. n j6 n ')0 n || 
if X n j / 6 n and 0 otherwise. With this notation, e n i = E[tt^J and f n £ = Efu^J (see 
Proposition 5.1). We start with the following lemmas. 

Lemma B.l. UnderP^, (i) E[S ni SC] = for anyi; (ii) E[(SCS ni ) 2 ] = 

for any i / j; (in) E[(SOS ni ) 4 ] = ^=r for any i ^ j. 

Proof, (i) Let O be a p n x p n orthogonal matrix such that O 6 n = ei, where ei 
denotes the first vector of the canonical basis of M Pn . Then the random vectors OS n j, 
i = l,...,n form a random sample from the uniform distribution over {x E S Pn ~ l : 
e'ix = 0}. Consequently, 0E[S n jSC]0' = ^r[(lp n — eje)), which yields the result. 
(ii)-(iii) It follows from the joint distribution of the OS„j’s just derived that, for any i ^ j , 
S'niSnj = (OS ni y(OS nj ) is equal in distribution to U'V, where the independent random 
( p n — l)-vectors U, V are uniformly distributed over S Pn ~ 2 . The result then follows from 
Lemma A.l(iii) in Paindaveine and Verdebout (2015). □ 

Lemma B.2. Under P^, (i) E[X' ni X nj ] = e 2 nl for anyi / j, (ii) E[(X / n -X nj ) 2 ] = e 2 2 + 
fli/(pn - 1) for any i / j, (in) E[(XCX nfc )(X' n£ X„ i )] = e n2 e 2 nl for any i / j and k^i 
such that { i,j , k, U) contains exactly three different indices, and (iv) E[(XbX n j) (X^X^)] = 
e ni ifi,j,k,£ are pairwise different. 

Proof. Part 1 of the lemma directly follows from X' m X„j = ( u n iO n + v n iS n i )'( u n jO n + 
VnjSnj) = UniUnj + v n iV n jS' ni S n j . For the remaining claims, note that, for i < j and k < £, 

(B.l) E[(X ni X nj -)(X nfe X n ^)] — E \u n iu n ju n ku n £ ] T E[v n iV n jv n i,v n f\ E[(S r , ? -S 7 ) y)(S 7 ) ^S /? ^)]. 

Part 2 of the result then follows from Lemma B.l(ii). For Parts 3-4, there is always one 
of the indices i,j,k,£ that is different from the other three indices, which implies that 
E[(SCS„j)(Sj lft S n ^)] = 0. The result readily follows. □ 

Lemma B.2 allows to prove Proposition 5.1. 

Proof of Proposition 5.1. Since the expectation readily follows from Lemma B.2(i), 
we can focus on the variance. Using Lemma B.2(i) again, we obtain 

Var Fn [ Rf l t } = 2 ^ J2 E (E[(XCX nj )(X; fc X n ,)]-e 4 1 ). 

I <i<j<n 1 <k<l<n 

In this sum, there are ( 2 ) terms corresponding to Lemma B.2(ii) and 6(2) terms (not 
contributing to the sum) corresponding to Lemma B.2(iv). Therefore, there are (^) 2 — 
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( 2 ) — 6 ( 4 ) = n ( n — l)( n — 2) terms corresponding to Lemma B.2(iii). Consequently, 

Var Fn [Rf] = ^ | Q ( e n 2 + fwitfPn ~ 1) “ 4l) + n ( n ~ l ){ n ~ 2 ) ( e n2 4l ~ 4l) j 

= ^ |( e n2 - 4l) + 2 (n - 2 )4l ( e n2 ~ 4l) + fn'j/iPn ~ 1 )|, 

which, since e n 2 = e n 2 — 4 i> establishes the result. □ 

Both following lemmas are needed to prove Theorem 5.1. 

Lemma B.3. Under P^, (i) E [(X ni - e n \0 n ) (X ni - e n \9 n )'] = e n 20 n 0' n + (I Pn - 

Ml); (n) Var [(X' ni 0 n - e nl ) (X.' nj 0 n - e nl )} = e n4 - e ^2 f or i = 3 and 4,2 f or i / 3! (™) 
E[XC(I Pn - 0 n 0' n )X nj \ = for i = j and 0 for i ^ j; (iv) Var[XC(I Pn - O n 0' n )X nj \ = 
fni ~ fn 2 for i = j and f 2 2 /(p n - 1 ) for i ± j . 

Proof, (i) Using the tangent-normal decomposition and Lemma B.l(i), we obtain 

E[(X n 2 G-nlOn)(^X-ni ^nl^n) ] = E [((^77,2 €-nl)@n Vni^ni)( < (V / ni ^nl)^n d~ Vni^ni) \ 

= E [{um - e nl ) 2 ]0 n 0' n + fra E[s ni s^] = e n 20 n 0' n + -^-(I Pn ~ M'J- 

Pn 1 

(ii)-(iv) The results readily follow from the fact that X! ni 0 n — e n i = u n i — e n \ and XC(I Pn — 
0 n 0' n )X n j = v n iV n jS' ni S n j (and from Lemma B.l(ii)). □ 

Lemma B.4. Consider expectations of the form Cij rs = E [A^A^A^A^] taken un¬ 
der P^, with An := (X- n i — e n \0 n )' (X n £ — e n {0 n ) and i<j<r<s<l. Then 

(i) Cijrs = 4,4 + ( E [«nt (“ni “ e nl) 2 ]) 2 + ^ if i = j = r = 8; (U) C ijrs = 
42 g n4 + 2e ;:lf V[v 2 ni (um - e n i) 2 ] + [pfjpp. if i = j < r = s; (Hi) c ijrs = 0 otherwise. 

Proof. We start with the proof of (iii). Assume that j = r, so that we are not in 
case (ii). Since case (i) is excluded, we have i < j or r < s. In both cases, one of the 
four indices i,j,r,s is different from the other three indices. Since E[Ajf] = 0, we obtain 
that Cij rs = 0, which establishes (iii). Turning to the proof of (i)-(ii), we use the tangent- 
normal decomposition again to write A j£ as (u nj — e n i)(u n i — e n i) + v n jV n g(S' n jS n (). Since 
E[(STS n ^) fc ] = 0 for any odd integer k, this leads to decomposing Cjj rr into 

Cjjrr — E [(Unj e n { ) (u nr Cnl)" (u n // e n \) j 

+4E [('Unj e n \)(Unr e n l)(u n £ Cnl) 'O n jVnrV n g(S n jS n (>)(S nr S n c')^ 

+2E [(tt nr — e n i) (u n g — e n \) v n jV n g(S n jS n () ] + E ^v n jV nr v n ^(S n jS n £) (S nr S n £) ] . 


The result then follows from Lemma B. 1 (ii)-(iii). 


□ 
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B.2. Proofs of Theorems 5.1 and 5.2. The proof of Theorem 5.1 is based on the 
following central limit theorem for martingale differences. 


Theorem B.l (Billingsley 1995, Theorem 35.12). Let D n £, £ = 1,..., n, n = 1,2,..., 
be a triangular array of random variables such that, for any n, D n \, D n 2 ,..., D nn is a 
martingale difference sequence with respect to some filtration 5F n \^£Fn2^ ■ ■ ■ ,T nn . Assume 
that, for any n,l, D n e has a finite variance. Letting <r^ = E [.D^ | fF n ,t- 1 ] (with being 
the trivial a-algebra {0, fl} for all n), further assume that, as n —> oo, 

n n 

(B.2) E a rd = 1 + °p(1) an d E^044UA^| > e]] — > 0. 

1= l t=\ 

Then D n i is asymptotically standard normal. 


Writing E n £ for the conditional expectation with respect to the cr-algebra 5F n i generated 
by X n \, ..., X n £, we have 

n]= — \ E ( X n* X "J - 4l) + (n - i)enl E (KiOn 

n <i<j<e i =i J 

Note that Rff = YJi=i D m, where D nl := E ni [I?,)) 1 ] - E n ,t-i [Rff] rewrites 

(B.3) D nt = EE / y^(X ni - e nl 0 n ) + (n - l)e n iO n X (X nl - e nl 0 n ) , £ = 1,2,... 
na n l J 

(throughout, sums over empty set of indices are defined as being equal to zero). The 
following lemmas then establish the conditions (B.2) in the present context. 

Lemma B.5. Let the assumptions of Theorem 5.1 hold. Then, under P^, (i) 4*40 

converges to one as n —> oo, and (ii) Var[)P ” =1 o( xj f\ converges to zero as n —> oo. 

(n) 

Lemma B. 6 . Let the assumptions of Theorem 5.1 hold and fixe > 0. Then, under P y F , 
YJl =l E[(D ne ) 2 l[\D n f\ > e]] -a 0 as n -> oo. 

In the rest of the paper, C is a positive constant that may change from line to line. 
Proof of Lemma B.5. (i) Note that 



2 _ 2 P 

G ni ~ 


n 2 a 2 


l -1 


1=1 


B x ni Cnl^n) ~X~ j E £nl@n) i^-ni ^nl^n) 


i-1 


E ( x nj €nl@n) 
0 = 1 
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By using Lemma B.3(i), we obtain 


2 _ 
a nt ~ 


‘-Pn^n‘2 

n 2 crl 


(B.4) 


f t-i 


l-\ 


■j 'y ] O^-ni^n e n i)(X nj -0 ri e n i) + 2(n l)eni e n \)(n 1) e nl 

l i,j =l 


2=1 


+ 


-Pn fn 2 


e-i 


Therefore 

(B.5) E[^] = 


(Pn - l)^ 2 ^ ^ 

b lJ -- 1 - 

2pn^n2 




n 2 cr 2 


(£ — l)e n 2 + 0 + (n — 1) e nl > + 


2Pn(l- l)/n2 

(Pi* - l)« 2 ^n ’ 


where we have used Lemma B.3(iii). This yields 


(B.6) 


s 2 ■= 

d n * 


: =E E ^] 


(n - l)p n e 2 2 2p n e n2 


+ 


i=i 


no* 


not 


'-(n~ 1) e nl + 


2 2 . ( n - l )'Pnfn2 


C Pn ~ l ) n °l 


as n -A oo, as was to be shown. 

(ii) From (B.4), we obtain Var[^” =1 <t 2 J < C(Var[A„] + Var[.B n ] + Var \C n ^), where 

A ._ Pn<2 n 2 \T^-T /-y/ /3 „ WV/ /5 _ \ R_ Pree T iie n 2 1 CY - ' P 

A n ■— n 2 a 2 L<=1 Lij=l(^n”n 6 nl J (X nj - 0 n e n \), B n n(T 2 2^=1 Z^i=l 

e n i) and C n := )T"=i Ej=i XT(I Pn - 9 n 9' n )'X n j. We establish the result by showing 

that, under the assumptions considered, Var[A n ], VarfB^] and VarfC^] all are o(l) as n —>• 
cx). We start with A n , which we split into 

~ n t—\ o ^ n 

An = ^ E HiKiOn ~ enl ) 2 + E E ^nfin ~ e n l)(KA ~ ^ 


n 2 o 2 


Vn^n2 

n 2 o 2 


1=1 i= l 
72—1 


£= 1 l<2<j<^-l 
\2 ^Pn^n2 


E ( n “ '0( X n^n - e n i) 2 + 2 if E ( n - j)( X nA ~ e ni)( X n/n - e n i), 

2=1 n l<i<J<72—1 


that is, into + A®, say. Clearly, 


2 ~2 72—1 

Var[A^] = E( n " *) 2 Var [( x '™ 0 n - e n i) s 

71 a 72 


2=1 


q Pn&n , 2(^4 6 ^. 2 ) ^ Pn&n 2 (^4 ^ 2 ) 


no 1 


n{p n el 2 ) 2 


e-ni 1 
n 


ne 


n 2 


which, by assumption, is o(l) as n —> oo. Since (X.' ni 9 n — e n \){K! n -9 n — e n i), i < j, and 
(K' nk 9 n — e n i)(X! ng 9 n — e n i), k < £, are uncorrelated as soon as (i,j) / ( k ,£), we obtain 


Var W» 2) ] = 


4Pn.e 


E (n-j) 2 Var[(XT0„-e nl )(X;/„-e nl )] <C 

l<2<Jf<72—1 


n A oi 


P 2 n e A n 2 


In view of the majorations 


,, 2 

Pn e n2 


0 


4 — 


< c 


T) 2 f ^ 

Pn c n 2 


(2np n ef ll e n2 )'^ 


= c(^f and ^<C(^) 2 . 

a n v J n 2 
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Var [A„ ], by assumption, is o(l) as n -» oo. Therefore, Var[A n ] is indeed o(l) as n —> oo. 
Turning to B n , 


2 2 ~2 n —1 

Var [B n ] = Pn %&* Var [ ^(n-z) (X.' ni O n -e nl ) 


n 2 cr 4 


2=1 


9 o ~9 22— 1 9 9 ~Q 

Pn<l<2 V~'/ -\2r / ^ n Pn e nl e n2 

2_^[n—i) e n ,2 S G- 


n 2 cr 4 

n i=i 




which is o(l) as n —> oo since it can be upper-bounded by 

nple 2 nl e 3 n2 r »P%4i g n2 = r ^2 

(2np ri e 2 1 e n2 ) 2 ne 2 , Y np^e^f 2 , ' / 2 2 

Finally, we consider C n . Proceeding as for A n , we split C n into 

(. Tl t— 1 0 72 

= X ni(!pn - M'jX nl + ^| E E X n.(!pn - Mi)X 

f=l 4=1 n ^=1 


n 2 <7 2 




fn2 

ri 2 ol 


72—1 


E(n-i)XG(I Pn -M;jX ni + 


2 — 1 


2,/ n2 

n 2 <r 2 


E (n-j)X^(I Pn -M'jX nj , 

1<2< < 7<72—1 


that is, into Cn^ + C^f 1 , say. Clearly, 

Var [C#>] = J^E (n -*) 2 Var[XG(I p „-M;jX ra ] < C f ^ {fn ^~ / " a) < C fni ~ f /" 2 , 


2—1 


so that Va.r[C'n ] is o(l) as n -» oo. Since XC(I Pn - 6 n 0' n )X n j, i < j, and X' nfc (I Pn - 
0 n 0' n )X. n £, k < l, are uncorrelated as soon as (i, j) / (&,£), we obtain 


V ”[ C » ) ] = ^ E (n-j) ! V a r[X'„ i (I R ,-»X)X^] <C^ 


•4 

7l2 


c 
< — 


<^(Pn - 1 ) Pn 


n l<i<j<n-l 

Therefore, Var[C n ] is also o(l) as n — > oo, which establishes the result. □ 

Proof of Lemma B.6. the Cauchy-Schwarz and Chebychev inequalities yield 

72 72 _ 71 _ 

(B.7) E E >^]]<E ^ E \ D niM\DnA >e]<-E ^ E KJVar[^]. 

f=i f=i fc £=i 

Recalling that cr 2 ^ = E[D 2 ^ | T n $- 1 ], (B.5) provides 

Var[T) n f] < E[D^] = E[cr 2 ^] < (e+ ne^e^ + — T" 

77-(J n y p n 1 / 72 

Using (B.3) and the inequalities (o + 6) 4 < 8(a 4 + 6 4 ) and a 2 > 2np n e^ ll e n 2 then yields 


E [Die] < -51 IE 


n 4 cr 4 


g-i 


^ ^ (-^-722 ^72l^7l) i^-n£ ^Tll^Tl) J ~\~Tl C n ^E [(X. n gO n ^nl) 


(B.8) < -$^E 


n 4 cr 4 


2=1 

-1 


^ ^ (Xm Cnl^n) {X-n£ ^n\^r 


2=1 


+ 


Cc n 4. 

2z2 ’ 


Tire 


7l2 
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Applying Lemma B.4, we have 
• i -1 


E 


( (X ni -e nl 0 n y(X n£ -e n i0n)) 1 = (£-1) (e 2 4 + —E [v 2 ni {u ni - e nl ) 2 } 2 + 3/l 
\ i=1 J J \ Pn 1 

+ 3(< - 1)« - 2) - e nl f] + 


n4 


Pn - 1 


By Cauchy-Schwarz, this yields 
■ t -1 


Pn 


E 


E< x m &nl0n) (X n g 6 n i0 ri 


i =1 


4n 


1 3 

< (^n g n4 + 6p n /n4e n 4 + 3/^ 4 ) + 


n 3 cd 


n 2 cr 4 


Pn&n2&n4 + ^Pn^fn2 fl 4 ^ 


< 


c ( &n4 fn,4&n4 / n 4^ _|_ ^ ( ^n4 ^ ( /n4Cn4^ 3 /" /n4 

fn2^n2 fn 2 


_ + f2 g2 + 

"n2 Jn2^n2 


+ 


- n3 \ 34 ' f 2 *2 ■ f4 / ■ n 2 V g 2 - V / 2 e 2 J /, 2 


n J \e, 

Plugging into (B.8), we conclude that 

n4 , fn4&n4 f n . 

4 + f2 z2 + f 4 
n2 «/ n2 e n2 «/n2 


+ 


+ 


n2 


E 


|-p4 I ^ C ( e n4 fnA&nA /n4^\ , C (&nA f /n4Cn4\ 3 / 2 /n4 \ ^ C / C n4 /n4 \ 

L "G -^3l^r+ 72 p2 ' ~e4~ j + ^2 i g2“ + V f 2 P 2 / + 72“ I - “ l E^2~ + ZT7T ) > 

n \ e n2 Jn2 e n2 J n2 J 11 \ e n2 v /n2 e n2 / ■hi2 / n \ ne n2 rl Jn2 / 


which, by assumption, is o(l/n) as n A oo. 

All majorations and o’s above being uniform in we hnally obtain that 

E < c (”,“„e[ i 4]) 1/ ’ -► 0 

£=i 

as n —> oo, which, in view of (B.7), establishes the result. 

Proof of Theorem 5.2. From Theorem 5.1, we have that, as b A oo, 
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where we used Lemma 2.11 from van der Vaart (1998). 
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SUPPLEMENTARY MATERIAL 

Supplement to “Testing Uniformity on High-Dimensional Spheres against 
Rotationally Symmetric Alternatives” 

(doi: completed by the typesetter; .pdf). In this supplementary article, we derive the fixed-p 
asymptotic non-null distribution of the Rayleigh test statistic in (3.4), and we show that, 
under FvML distributions, the conditions (i)-(iii) of Theorem 5.1 always hold. 
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